Overview
Brought to you by YData
Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 1338 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 156.9 KiB |
| Average record size in memory | 120.1 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 6 |
Anual_Salary is highly overall correlated with Hospital_expenditure and 5 other fields | High correlation |
Hospital_expenditure is highly overall correlated with Anual_Salary and 4 other fields | High correlation |
NUmber_of_past_hospitalizations is highly overall correlated with Anual_Salary and 4 other fields | High correlation |
age is highly overall correlated with charges and 1 other fields | High correlation |
charges is highly overall correlated with Anual_Salary and 6 other fields | High correlation |
num_of_steps is highly overall correlated with Anual_Salary and 6 other fields | High correlation |
past_consultations is highly overall correlated with Anual_Salary and 3 other fields | High correlation |
smoker_yes is highly overall correlated with Anual_Salary and 5 other fields | High correlation |
children has 574 (42.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-10-27 17:51:55.396723 |
|---|---|
| Analysis finished | 2025-10-27 17:52:07.080559 |
| Duration | 11.68 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
age
Real number (ℝ)
High correlation
| Distinct | 47 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.307922 |
| Minimum | 18 |
|---|---|
| Maximum | 64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 27 |
| median | 39 |
| Q3 | 51 |
| 95-th percentile | 62 |
| Maximum | 64 |
| Range | 46 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 13.987523 |
|---|---|
| Coefficient of variation (CV) | 0.35584489 |
| Kurtosis | -1.2299762 |
| Mean | 39.307922 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.046115549 |
| Sum | 52594 |
| Variance | 195.65081 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 69 | 5.2% |
| 19 | 66 | 4.9% |
| 39 | 34 | 2.5% |
| 47 | 29 | 2.2% |
| 48 | 29 | 2.2% |
| 45 | 29 | 2.2% |
| 46 | 29 | 2.2% |
| 52 | 29 | 2.2% |
| 50 | 29 | 2.2% |
| 51 | 29 | 2.2% |
| Other values (37) | 966 |
| Value | Count | Frequency (%) |
| 18 | 69 | |
| 19 | 66 | |
| 20 | 28 | |
| 21 | 28 | |
| 22 | 26 | 1.9% |
| 23 | 27 | 2.0% |
| 24 | 28 | |
| 25 | 27 | 2.0% |
| 26 | 28 | |
| 27 | 28 |
| Value | Count | Frequency (%) |
| 64 | 22 | |
| 63 | 23 | |
| 62 | 23 | |
| 61 | 23 | |
| 60 | 23 | |
| 59 | 25 | |
| 58 | 25 | |
| 57 | 26 | |
| 56 | 26 | |
| 55 | 26 |
bmi
Real number (ℝ)
| Distinct | 547 |
|---|---|
| Distinct (%) | 40.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.664518 |
| Minimum | 15.96 |
|---|---|
| Maximum | 53.13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 15.96 |
|---|---|
| 5-th percentile | 21.256 |
| Q1 | 26.315 |
| median | 30.4 |
| Q3 | 34.65625 |
| 95-th percentile | 41.106 |
| Maximum | 53.13 |
| Range | 37.17 |
| Interquartile range (IQR) | 8.34125 |
Descriptive statistics
| Standard deviation | 6.0948532 |
|---|---|
| Coefficient of variation (CV) | 0.19875914 |
| Kurtosis | -0.045394383 |
| Mean | 30.664518 |
| Median Absolute Deviation (MAD) | 4.18 |
| Skewness | 0.28449493 |
| Sum | 41029.125 |
| Variance | 37.147236 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 32.3 | 13 | 1.0% |
| 28.31 | 9 | 0.7% |
| 31.35 | 8 | 0.6% |
| 30.875 | 8 | 0.6% |
| 30.8 | 8 | 0.6% |
| 34.1 | 8 | 0.6% |
| 30.495 | 8 | 0.6% |
| 28.88 | 8 | 0.6% |
| 24.32 | 7 | 0.5% |
| 32.775 | 7 | 0.5% |
| Other values (537) | 1254 |
| Value | Count | Frequency (%) |
| 15.96 | 1 | 0.1% |
| 16.815 | 2 | |
| 17.195 | 1 | 0.1% |
| 17.29 | 3 | |
| 17.385 | 1 | 0.1% |
| 17.4 | 1 | 0.1% |
| 17.48 | 1 | 0.1% |
| 17.67 | 1 | 0.1% |
| 17.765 | 1 | 0.1% |
| 17.8 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 53.13 | 1 | |
| 52.58 | 1 | |
| 50.38 | 1 | |
| 49.06 | 1 | |
| 48.07 | 1 | |
| 47.74 | 1 | |
| 47.6 | 1 | |
| 47.52 | 1 | |
| 47.41 | 1 | |
| 46.75 | 1 |
children
Real number (ℝ)
Zeros
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0904335 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 574 |
| Zeros (%) | 42.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.199619 |
|---|---|
| Coefficient of variation (CV) | 1.1001304 |
| Kurtosis | 0.19055935 |
| Mean | 1.0904335 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.93462317 |
| Sum | 1459 |
| Variance | 1.4390857 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 574 | |
| 1 | 326 | |
| 2 | 240 | |
| 3 | 156 | 11.7% |
| 4 | 25 | 1.9% |
| 5 | 17 | 1.3% |
| Value | Count | Frequency (%) |
| 0 | 574 | |
| 1 | 326 | |
| 2 | 240 | |
| 3 | 156 | 11.7% |
| 4 | 25 | 1.9% |
| 5 | 17 | 1.3% |
| Value | Count | Frequency (%) |
| 5 | 17 | 1.3% |
| 4 | 25 | 1.9% |
| 3 | 156 | 11.7% |
| 2 | 240 | |
| 1 | 326 | |
| 0 | 574 |
Claim_Amount
Real number (ℝ)
| Distinct | 1325 |
|---|---|
| Distinct (%) | 99.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33364.874 |
| Minimum | 1920.1363 |
|---|---|
| Maximum | 77277.988 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 1920.1363 |
|---|---|
| 5-th percentile | 8327.4096 |
| Q1 | 20947.645 |
| median | 33700.311 |
| Q3 | 44978.873 |
| 95-th percentile | 57500.208 |
| Maximum | 77277.988 |
| Range | 75357.852 |
| Interquartile range (IQR) | 24031.228 |
Descriptive statistics
| Standard deviation | 15535.346 |
|---|---|
| Coefficient of variation (CV) | 0.46561979 |
| Kurtosis | -0.69136873 |
| Mean | 33364.874 |
| Median Absolute Deviation (MAD) | 11950.315 |
| Skewness | 0.098037984 |
| Sum | 44642202 |
| Variance | 2.4134696 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33700.31068 | 14 | 1.0% |
| 34007.66739 | 1 | 0.1% |
| 34827.57407 | 1 | 0.1% |
| 27682.7277 | 1 | 0.1% |
| 12770.13249 | 1 | 0.1% |
| 15073.0378 | 1 | 0.1% |
| 56840.23864 | 1 | 0.1% |
| 28359.89423 | 1 | 0.1% |
| 37441.40236 | 1 | 0.1% |
| 8819.797107 | 1 | 0.1% |
| Other values (1315) | 1315 |
| Value | Count | Frequency (%) |
| 1920.136268 | 1 | |
| 2912.590584 | 1 | |
| 3037.725919 | 1 | |
| 3370.398323 | 1 | |
| 3768.603033 | 1 | |
| 3830.10039 | 1 | |
| 3927.892067 | 1 | |
| 3965.606464 | 1 | |
| 4149.950486 | 1 | |
| 4203.109713 | 1 |
| Value | Count | Frequency (%) |
| 77277.98848 | 1 | |
| 76028.85348 | 1 | |
| 73894.36704 | 1 | |
| 72760.92979 | 1 | |
| 72147.09404 | 1 | |
| 71885.17447 | 1 | |
| 71776.98023 | 1 | |
| 71219.0483 | 1 | |
| 70963.84723 | 1 | |
| 69927.51664 | 1 |
past_consultations
Real number (ℝ)
High correlation
| Distinct | 39 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.215247 |
| Minimum | 1 |
|---|---|
| Maximum | 40 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 9 |
| median | 15 |
| Q3 | 20 |
| 95-th percentile | 29 |
| Maximum | 40 |
| Range | 39 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 7.450962 |
|---|---|
| Coefficient of variation (CV) | 0.48970367 |
| Kurtosis | -0.17971368 |
| Mean | 15.215247 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.41536423 |
| Sum | 20358 |
| Variance | 55.516835 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 70 | 5.2% |
| 15 | 70 | 5.2% |
| 10 | 68 | 5.1% |
| 21 | 68 | 5.1% |
| 18 | 68 | 5.1% |
| 13 | 66 | 4.9% |
| 9 | 63 | 4.7% |
| 16 | 61 | 4.6% |
| 19 | 60 | 4.5% |
| 17 | 58 | 4.3% |
| Other values (29) | 686 |
| Value | Count | Frequency (%) |
| 1 | 2 | 0.1% |
| 2 | 23 | 1.7% |
| 3 | 24 | 1.8% |
| 4 | 35 | |
| 5 | 45 | |
| 6 | 38 | |
| 7 | 54 | |
| 8 | 56 | |
| 9 | 63 | |
| 10 | 68 |
| Value | Count | Frequency (%) |
| 40 | 1 | 0.1% |
| 38 | 3 | 0.2% |
| 37 | 1 | 0.1% |
| 36 | 4 | 0.3% |
| 35 | 7 | |
| 34 | 4 | 0.3% |
| 33 | 4 | 0.3% |
| 32 | 13 | |
| 31 | 13 | |
| 30 | 13 |
num_of_steps
Real number (ℝ)
High correlation
| Distinct | 1335 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 910014.33 |
| Minimum | 695430 |
|---|---|
| Maximum | 1107872 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 695430 |
|---|---|
| 5-th percentile | 746831.1 |
| Q1 | 847489.75 |
| median | 914300 |
| Q3 | 971510 |
| 95-th percentile | 1062834.1 |
| Maximum | 1107872 |
| Range | 412442 |
| Interquartile range (IQR) | 124020.25 |
Descriptive statistics
| Standard deviation | 91783.198 |
|---|---|
| Coefficient of variation (CV) | 0.10085907 |
| Kurtosis | -0.61835488 |
| Mean | 910014.33 |
| Median Absolute Deviation (MAD) | 63276.5 |
| Skewness | -0.082096436 |
| Sum | 1.2175992 × 109 |
| Variance | 8.4241555 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 914300 | 4 | 0.3% |
| 949238 | 1 | 0.1% |
| 954363 | 1 | 0.1% |
| 946475 | 1 | 0.1% |
| 959985 | 1 | 0.1% |
| 946598 | 1 | 0.1% |
| 943649 | 1 | 0.1% |
| 948296 | 1 | 0.1% |
| 943900 | 1 | 0.1% |
| 952103 | 1 | 0.1% |
| Other values (1325) | 1325 |
| Value | Count | Frequency (%) |
| 695430 | 1 | |
| 699157 | 1 | |
| 699159 | 1 | |
| 700250 | 1 | |
| 701227 | 1 | |
| 702341 | 1 | |
| 704425 | 1 | |
| 706423 | 1 | |
| 706796 | 1 | |
| 711546 | 1 |
| Value | Count | Frequency (%) |
| 1107872 | 1 | |
| 1106821 | 1 | |
| 1100328 | 1 | |
| 1095960 | 1 | |
| 1092005 | 1 | |
| 1091279 | 1 | |
| 1091267 | 1 | |
| 1086635 | 1 | |
| 1086594 | 1 | |
| 1085496 | 1 |
Hospital_expenditure
Real number (ℝ)
High correlation
| Distinct | 1335 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15816825 |
| Minimum | 29452.533 |
|---|---|
| Maximum | 2.616317 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 29452.533 |
|---|---|
| 5-th percentile | 1258906.9 |
| Q1 | 4084941 |
| median | 7490336.9 |
| Q3 | 10826298 |
| 95-th percentile | 77451060 |
| Maximum | 2.616317 × 108 |
| Range | 2.6160225 × 108 |
| Interquartile range (IQR) | 6741357.2 |
Descriptive statistics
| Standard deviation | 26656991 |
|---|---|
| Coefficient of variation (CV) | 1.6853566 |
| Kurtosis | 18.935944 |
| Mean | 15816825 |
| Median Absolute Deviation (MAD) | 3398346 |
| Skewness | 3.7534547 |
| Sum | 2.1162912 × 1010 |
| Variance | 7.1059515 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7490336.905 | 4 | 0.3% |
| 12144973.48 | 1 | 0.1% |
| 5233108.484 | 1 | 0.1% |
| 4895703.017 | 1 | 0.1% |
| 2921439.006 | 1 | 0.1% |
| 8228465.305 | 1 | 0.1% |
| 2289506.112 | 1 | 0.1% |
| 7290091.666 | 1 | 0.1% |
| 6113973.771 | 1 | 0.1% |
| 8658220.438 | 1 | 0.1% |
| Other values (1325) | 1325 |
| Value | Count | Frequency (%) |
| 29452.53296 | 1 | |
| 35822.43757 | 1 | |
| 57647.08834 | 1 | |
| 77956.02763 | 1 | |
| 87483.3732 | 1 | |
| 87572.09263 | 1 | |
| 104079.8371 | 1 | |
| 174710.7356 | 1 | |
| 187778.4332 | 1 | |
| 249159.4253 | 1 |
| Value | Count | Frequency (%) |
| 261631699.3 | 1 | |
| 252892382.6 | 1 | |
| 223644981.3 | 1 | |
| 201515184.8 | 1 | |
| 170380500.5 | 1 | |
| 148034634.6 | 1 | |
| 144061589.9 | 1 | |
| 126353660.6 | 1 | |
| 123627927 | 1 | |
| 122405879.8 | 1 |
NUmber_of_past_hospitalizations
Categorical
High correlation
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.1 KiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 0.0 | |
| 3.0 | 2 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 959 | |
| 2.0 | 227 | 17.0% |
| 0.0 | 150 | 11.2% |
| 3.0 | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 959 | |
| 2.0 | 227 | 17.0% |
| 0.0 | 150 | 11.2% |
| 3.0 | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 1338 | |
| 1 | 959 | |
| 2 | 227 | 5.7% |
| 3 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 1338 | |
| 1 | 959 | |
| 2 | 227 | 5.7% |
| 3 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 1338 | |
| 1 | 959 | |
| 2 | 227 | 5.7% |
| 3 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 1338 | |
| 1 | 959 | |
| 2 | 227 | 5.7% |
| 3 | 2 | < 0.1% |
Anual_Salary
Real number (ℝ)
High correlation
| Distinct | 1333 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6866356 × 108 |
| Minimum | 2747071.9 |
|---|---|
| Maximum | 4.1171966 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 2747071.9 |
|---|---|
| 5-th percentile | 28048141 |
| Q1 | 77550855 |
| median | 1.4193609 × 108 |
| Q3 | 3.2252025 × 108 |
| 95-th percentile | 1.7417077 × 109 |
| Maximum | 4.1171966 × 109 |
| Range | 4.1144496 × 109 |
| Interquartile range (IQR) | 2.4496939 × 108 |
Descriptive statistics
| Standard deviation | 5.6581568 × 108 |
|---|---|
| Coefficient of variation (CV) | 1.5347752 |
| Kurtosis | 7.4130177 |
| Mean | 3.6866356 × 108 |
| Median Absolute Deviation (MAD) | 77679633 |
| Skewness | 2.6227419 |
| Sum | 4.9327185 × 1011 |
| Variance | 3.2014739 × 1017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 141936093.2 | 6 | 0.4% |
| 207558006.7 | 1 | 0.1% |
| 174967925.2 | 1 | 0.1% |
| 204185331.2 | 1 | 0.1% |
| 174312311.3 | 1 | 0.1% |
| 203539312.1 | 1 | 0.1% |
| 236017476 | 1 | 0.1% |
| 172852746.8 | 1 | 0.1% |
| 169599117.7 | 1 | 0.1% |
| 208172884.9 | 1 | 0.1% |
| Other values (1323) | 1323 |
| Value | Count | Frequency (%) |
| 2747071.908 | 1 | |
| 3150786.26 | 1 | |
| 3550883.601 | 1 | |
| 4038871.664 | 1 | |
| 4979935.53 | 1 | |
| 6062705.347 | 1 | |
| 6735551.472 | 1 | |
| 7086383.729 | 1 | |
| 7109737.472 | 1 | |
| 8140885.2 | 1 |
| Value | Count | Frequency (%) |
| 4117196637 | 1 | |
| 4006358505 | 1 | |
| 3640806683 | 1 | |
| 3484216117 | 1 | |
| 3101107370 | 1 | |
| 2780642185 | 1 | |
| 2682704509 | 1 | |
| 2489507977 | 1 | |
| 2463221852 | 1 | |
| 2446348418 | 1 |
charges
Real number (ℝ)
High correlation
| Distinct | 1337 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13270.422 |
| Minimum | 1121.8739 |
|---|---|
| Maximum | 63770.428 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 1121.8739 |
|---|---|
| 5-th percentile | 1757.7534 |
| Q1 | 4740.2872 |
| median | 9382.033 |
| Q3 | 16639.913 |
| 95-th percentile | 41181.828 |
| Maximum | 63770.428 |
| Range | 62648.554 |
| Interquartile range (IQR) | 11899.625 |
Descriptive statistics
| Standard deviation | 12110.011 |
|---|---|
| Coefficient of variation (CV) | 0.91255659 |
| Kurtosis | 1.6062987 |
| Mean | 13270.422 |
| Median Absolute Deviation (MAD) | 5018.7571 |
| Skewness | 1.5158797 |
| Sum | 17755825 |
| Variance | 1.4665237 × 108 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1639.5631 | 2 | 0.1% |
| 12949.1554 | 1 | 0.1% |
| 12928.7911 | 1 | 0.1% |
| 12925.886 | 1 | 0.1% |
| 12913.9924 | 1 | 0.1% |
| 12890.05765 | 1 | 0.1% |
| 12829.4551 | 1 | 0.1% |
| 12815.44495 | 1 | 0.1% |
| 12797.20962 | 1 | 0.1% |
| 12741.16745 | 1 | 0.1% |
| Other values (1327) | 1327 |
| Value | Count | Frequency (%) |
| 1121.8739 | 1 | |
| 1131.5066 | 1 | |
| 1135.9407 | 1 | |
| 1136.3994 | 1 | |
| 1137.011 | 1 | |
| 1137.4697 | 1 | |
| 1141.4451 | 1 | |
| 1146.7966 | 1 | |
| 1149.3959 | 1 | |
| 1163.4627 | 1 |
| Value | Count | Frequency (%) |
| 63770.42801 | 1 | |
| 62592.87309 | 1 | |
| 60021.39897 | 1 | |
| 58571.07448 | 1 | |
| 55135.40209 | 1 | |
| 52590.82939 | 1 | |
| 51194.55914 | 1 | |
| 49577.6624 | 1 | |
| 48970.2476 | 1 | |
| 48885.13561 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 676 | |
| 0.0 | 662 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 676 | |
| 0.0 | 662 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2000 | |
| . | 1338 | |
| 1 | 676 | 16.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2000 | |
| . | 1338 | |
| 1 | 676 | 16.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2000 | |
| . | 1338 | |
| 1 | 676 | 16.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2000 | |
| . | 1338 | |
| 1 | 676 | 16.8% |
smoker_yes
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.1 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 1064 | |
| 1.0 | 274 | 20.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 1064 | |
| 1.0 | 274 | 20.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2402 | |
| . | 1338 | |
| 1 | 274 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2402 | |
| . | 1338 | |
| 1 | 274 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2402 | |
| . | 1338 | |
| 1 | 274 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2402 | |
| . | 1338 | |
| 1 | 274 | 6.8% |
region_northwest
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.1 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 1013 | |
| 1.0 | 325 | 24.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 1013 | |
| 1.0 | 325 | 24.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2351 | |
| . | 1338 | |
| 1 | 325 | 8.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2351 | |
| . | 1338 | |
| 1 | 325 | 8.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2351 | |
| . | 1338 | |
| 1 | 325 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2351 | |
| . | 1338 | |
| 1 | 325 | 8.1% |
region_southeast
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.1 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 974 | |
| 1.0 | 364 | 27.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 974 | |
| 1.0 | 364 | 27.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2312 | |
| . | 1338 | |
| 1 | 364 | 9.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2312 | |
| . | 1338 | |
| 1 | 364 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2312 | |
| . | 1338 | |
| 1 | 364 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2312 | |
| . | 1338 | |
| 1 | 364 | 9.1% |
region_southwest
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.1 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 1013 | |
| 1.0 | 325 | 24.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 1013 | |
| 1.0 | 325 | 24.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2351 | |
| . | 1338 | |
| 1 | 325 | 8.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2351 | |
| . | 1338 | |
| 1 | 325 | 8.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2351 | |
| . | 1338 | |
| 1 | 325 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4014 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2351 | |
| . | 1338 | |
| 1 | 325 | 8.1% |
Interactions
Correlations
| Anual_Salary | Claim_Amount | Hospital_expenditure | NUmber_of_past_hospitalizations | age | bmi | charges | children | num_of_steps | past_consultations | region_northwest | region_southeast | region_southwest | sex_male | smoker_yes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Anual_Salary | 1.000 | 0.339 | 0.664 | 0.780 | 0.454 | 0.130 | 0.941 | 0.088 | 0.935 | 0.509 | 0.108 | 0.107 | 0.113 | 0.069 | 0.755 |
| Claim_Amount | 0.339 | 1.000 | 0.279 | 0.312 | 0.119 | 0.083 | 0.366 | 0.046 | 0.362 | 0.232 | 0.029 | 0.052 | 0.047 | 0.046 | 0.390 |
| Hospital_expenditure | 0.664 | 0.279 | 1.000 | 0.750 | 0.180 | 0.123 | 0.674 | 0.021 | 0.672 | 0.418 | 0.057 | 0.119 | 0.048 | 0.071 | 0.670 |
| NUmber_of_past_hospitalizations | 0.780 | 0.312 | 0.750 | 1.000 | 0.374 | 0.157 | 0.713 | 0.186 | 0.752 | 0.370 | 0.042 | 0.120 | 0.000 | 0.091 | 0.677 |
| age | 0.454 | 0.119 | 0.180 | 0.374 | 1.000 | 0.111 | 0.526 | 0.053 | 0.519 | 0.159 | 0.000 | 0.000 | 0.000 | 0.000 | 0.049 |
| bmi | 0.130 | 0.083 | 0.123 | 0.157 | 0.111 | 1.000 | 0.120 | 0.011 | 0.120 | 0.116 | 0.139 | 0.271 | 0.005 | 0.000 | 0.000 |
| charges | 0.941 | 0.366 | 0.674 | 0.713 | 0.526 | 0.120 | 1.000 | 0.135 | 0.993 | 0.519 | 0.000 | 0.097 | 0.086 | 0.063 | 0.832 |
| children | 0.088 | 0.046 | 0.021 | 0.186 | 0.053 | 0.011 | 0.135 | 1.000 | 0.134 | 0.059 | 0.045 | 0.000 | 0.000 | 0.000 | 0.037 |
| num_of_steps | 0.935 | 0.362 | 0.672 | 0.752 | 0.519 | 0.120 | 0.993 | 0.134 | 1.000 | 0.517 | 0.045 | 0.123 | 0.000 | 0.157 | 0.803 |
| past_consultations | 0.509 | 0.232 | 0.418 | 0.370 | 0.159 | 0.116 | 0.519 | 0.059 | 0.517 | 1.000 | 0.000 | 0.057 | 0.000 | 0.056 | 0.548 |
| region_northwest | 0.108 | 0.029 | 0.057 | 0.042 | 0.000 | 0.139 | 0.000 | 0.045 | 0.045 | 0.000 | 1.000 | 0.343 | 0.318 | 0.000 | 0.022 |
| region_southeast | 0.107 | 0.052 | 0.119 | 0.120 | 0.000 | 0.271 | 0.097 | 0.000 | 0.123 | 0.057 | 0.343 | 1.000 | 0.343 | 0.000 | 0.061 |
| region_southwest | 0.113 | 0.047 | 0.048 | 0.000 | 0.000 | 0.005 | 0.086 | 0.000 | 0.000 | 0.000 | 0.318 | 0.343 | 1.000 | 0.000 | 0.022 |
| sex_male | 0.069 | 0.046 | 0.071 | 0.091 | 0.000 | 0.000 | 0.063 | 0.000 | 0.157 | 0.056 | 0.000 | 0.000 | 0.000 | 1.000 | 0.069 |
| smoker_yes | 0.755 | 0.390 | 0.670 | 0.677 | 0.049 | 0.000 | 0.832 | 0.037 | 0.803 | 0.548 | 0.022 | 0.061 | 0.022 | 0.069 | 1.000 |
Missing values
Sample
| age | bmi | children | Claim_Amount | past_consultations | num_of_steps | Hospital_expenditure | NUmber_of_past_hospitalizations | Anual_Salary | charges | sex_male | smoker_yes | region_northwest | region_southeast | region_southwest | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 18.0 | 23.21 | 0.0 | 29087.543130 | 17.0 | 715428.0 | 4.720921e+06 | 0.0 | 5.578497e+07 | 1121.8739 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 |
| 1 | 18.0 | 30.14 | 0.0 | 39053.674370 | 7.0 | 699157.0 | 4.329832e+06 | 0.0 | 1.370089e+07 | 1131.5066 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 |
| 2 | 18.0 | 33.33 | 0.0 | 39023.627590 | 19.0 | 702341.0 | 6.884861e+06 | 0.0 | 7.352311e+07 | 1135.9407 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 |
| 3 | 18.0 | 33.66 | 0.0 | 28185.393320 | 11.0 | 700250.0 | 4.274774e+06 | 0.0 | 7.581968e+07 | 1136.3994 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 |
| 4 | 18.0 | 34.10 | 0.0 | 14697.859410 | 16.0 | 711584.0 | 3.787294e+06 | 0.0 | 2.301232e+07 | 1137.0110 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 |
| 5 | 18.0 | 34.43 | 0.0 | 26488.339120 | 20.0 | 717162.0 | 3.696161e+06 | 0.0 | 1.419361e+08 | 1137.4697 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 |
| 6 | 18.0 | 37.29 | 0.0 | 33217.365480 | 13.0 | 699159.0 | 8.765292e+05 | 0.0 | 6.906067e+07 | 1141.4451 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 |
| 7 | 18.0 | 41.14 | 0.0 | 46770.585330 | 12.0 | 706423.0 | 4.486741e+06 | 0.0 | 9.719378e+07 | 1146.7966 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 |
| 8 | 18.0 | 43.01 | 0.0 | 9715.650411 | 17.0 | 914300.0 | 9.216440e+06 | 0.0 | 5.888197e+07 | 1149.3959 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 |
| 9 | 18.0 | 53.13 | 0.0 | 17046.585150 | 19.0 | 704425.0 | 1.458972e+06 | 0.0 | 9.426182e+07 | 1163.4627 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 |
| age | bmi | children | Claim_Amount | past_consultations | num_of_steps | Hospital_expenditure | NUmber_of_past_hospitalizations | Anual_Salary | charges | sex_male | smoker_yes | region_northwest | region_southeast | region_southwest | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1328 | 44.0 | 38.060 | 0.0 | 76028.85348 | 25.0 | 1072324.0 | 122405879.8 | 2.0 | 2.430290e+09 | 48885.13561 | 0.0 | 1.0 | 0.0 | 1.0 | 0.0 |
| 1329 | 59.0 | 41.140 | 1.0 | 53104.92621 | 38.0 | 1079931.0 | 126353660.6 | 2.0 | 2.399896e+09 | 48970.24760 | 1.0 | 1.0 | 0.0 | 1.0 | 0.0 |
| 1330 | 64.0 | 36.960 | 2.0 | 65641.24823 | 28.0 | 1091279.0 | 123627927.0 | 2.0 | 2.489508e+09 | 49577.66240 | 1.0 | 1.0 | 0.0 | 1.0 | 0.0 |
| 1331 | 28.0 | 36.400 | 1.0 | 55590.75527 | 26.0 | 1080113.0 | 144061589.9 | 2.0 | 2.682705e+09 | 51194.55914 | 1.0 | 1.0 | 0.0 | 0.0 | 1.0 |
| 1332 | 60.0 | 32.800 | 0.0 | 77277.98848 | 40.0 | 1095960.0 | 148034634.6 | 2.0 | 2.780642e+09 | 52590.82939 | 1.0 | 1.0 | 0.0 | 0.0 | 1.0 |
| 1333 | 33.0 | 35.530 | 0.0 | 63142.25346 | 32.0 | 1091267.0 | 170380500.5 | 2.0 | 3.101107e+09 | 55135.40209 | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 |
| 1334 | 31.0 | 38.095 | 1.0 | 43419.95227 | 31.0 | 1107872.0 | 201515184.8 | 2.0 | 3.484216e+09 | 58571.07448 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 |
| 1335 | 52.0 | 34.485 | 3.0 | 52458.92353 | 25.0 | 1092005.0 | 223644981.3 | 2.0 | 3.640807e+09 | 60021.39897 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 |
| 1336 | 45.0 | 30.360 | 0.0 | 69927.51664 | 34.0 | 1106821.0 | 252892382.6 | 3.0 | 4.006359e+09 | 62592.87309 | 1.0 | 1.0 | 0.0 | 1.0 | 0.0 |
| 1337 | 54.0 | 47.410 | 0.0 | 63982.80926 | 31.0 | 1100328.0 | 261631699.3 | 3.0 | 4.117197e+09 | 63770.42801 | 0.0 | 1.0 | 0.0 | 1.0 | 0.0 |